In silico prediction of aqueous solubility: a multimodel protocol based on chemical similarity.
نویسندگان
چکیده
Aqueous solubility is one of the most important ADMET properties to assess and to optimize during the drug discovery process. At present, accurate prediction of solubility remains very challenging and there is an important need of independent benchmarking of the existing in silico models such as to suggest solutions for their improvement. In this study, we developed a new protocol for improved solubility prediction by combining several existing models available in commercial or free software packages. We first performed an evaluation of ten in silico models for aqueous solubility prediction on several data sets in order to assess the reliability of the methods, and we proposed a new diverse data set of 150 molecules as relevant test set, SolDiv150. We developed a random forest protocol to evaluate the performance of different fingerprints for aqueous solubility prediction based on molecular structure similarity. Our protocol, called a "multimodel protocol", allows selecting the most accurate model for a compound of interest among the employed models or software packages, achieving r(2) of 0.84 when applied to SolDiv150. We also found that all models assessed here performed better on druglike molecules than on real drugs, thus additional improvement is needed in this direction. Overall, our approach enlarges the applicability domain as demonstrated by the more accurate results for solubility prediction obtained using our protocol in comparison to using individual models.
منابع مشابه
Accurate Solubility Prediction with Error Bars for Electrolytes: A Machine Learning Approach
Accurate in silico models for predicting aqueous solubility are needed in drug design and discovery and many other areas of chemical research. We present a statistical modeling of aqueous solubility based on measured data, using a Gaussian Process nonlinear regression model (GPsol). We compare our results with those of 14 scientific studies and 6 commercial tools. This shows that the developed ...
متن کاملCorrelation and Prediction of Solubility of CO2 in Amine Aqueous Solutions
The solubility of CO2 in the primary, secondary, tertiary and sterically hindered amine aqueous solutions at various conditions was studied. In the present work, the Modified Kent-Eisenberg (M-KE), the Extended Debye-Hückel (E-DH) and the Pitzer models were employed to study the solubility of CO2 in amine aqueous solutions. Two explicit equations are presented to evalu...
متن کاملCorrelation and Prediction of Acid Gases Solubility in Various Aqueous Alkanolamine Solutions Using Electrolyte Cubic Square-Well Equation of State
The object of this work is solubility correlation and prediction of CO2 and H2S in various aqueous alkanolamines using the electrolyte cubic square-well equation of state (eCSW EoS) (Haghtalab, A.,Mazloumi, S. H., (2010), Electrolyte Cubic Square-Well Equation of State for Computation of the Solubility CO2 and H2S in Aqueous MDEA Solutions, Ind. Eng. Chem. Res.,49,6221-623). The eEoS systemati...
متن کاملPrediction of the pharmaceutical solubility in water and organic solvents via different soft computing models
Solubility data of solid in aqueous and different organic solvents are very important physicochemical properties considered in the design of the industrial processes and the theoretical studies. In this study, experimental solubility data of 666 pharmaceutical compounds in water and 712 pharmaceutical compounds in organic solvents were collected from different sources. Three different artificia...
متن کاملSolubility Prediction of Anthracene in Non-Aqueous Solvent Mixtures Using Jouyban-Acree Model
A quanitative structure property relationship was proposed to calculate the binary interaction terms of the Jouyban-Acree model using solubility parameter, boiling point, vapour pressure and density of solvents. The applicability of the proposed method for reproducing solubility data of anthracene in binary solvents has been evaluated using 116 solubility data sets collected from the lite...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Molecular pharmaceutics
دوره 9 11 شماره
صفحات -
تاریخ انتشار 2012